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Abstract 

The receptor for advanced glycosylation end products (RAGE) is a multiligand receptor involved in diverse cell signaling pathways. 
Previous studies show that this gene expresses several splice variants in human, mouse, and dog. Alternative splicing (AS) plays an 
important role in expanding transcriptomic and proteomic diversity, and it has been related to disease. AS is also one of the main 
evolutionary mechanisms in mammalian genomes. However, limited information is available regarding the AS of RAGE in a wide 
context of mammalian tissues. In this study, we examined in detail the different RAGE mRNAs generated by AS from six mammals, 
including two primates (human and monkey), two artiodactyla (cow and pig), and two rodentia (mouse and rat) in 6-1 8 different 
tissues including fetal, adult, and tumor. By nested reverse transcription-polymerase chain reaction (RT-PCR) we identified a high 
number of splice variants including noncoding transcripts and predicted coding ones with different potential protein modifications 
affecting mainly the transmembrane and ligand-binding domains that could influence their biological function. However, analysis of 
RNA-seq data enabled detecting only the most abundant splice variants. More than 80% of the detected RT-PCR variants (87 of 101 
transcripts) are novel (different exon/intron structure to the previously described ones), and interestingly, 20-60% of the total 
transcripts (depending on the species) are noncoding ones that present tissue specificity. Our results suggest that RAGE undergoes 
extensive AS in mammals, with different expression patterns among adult, fetal, and tumor tissues. Moreover, most splice variants 
seem to be species specific, especially the noncoding variants, with only two (canonical human Tvl -RAGE, and human N-truncated or 
Tvl 0-RAGE) conserved among the six different species. This could indicate a special evolution pattern of this gene at mRNA level. 
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Introduction 

Nowadays, it is widely known that alternative splicing (AS) 
plays an important role in expanding transcriptomic diversity 
(Blencowe 2006; Nilsen and Graveley 2010; Mercer et al. 
2011), as well as in the generation of proteomic variety, 
which has considerable biological consequences (Xing and 
Lee 2007; Colak et al. 2013). Recent studies have shown 
that in addition to proteins, RNA structure is also involved in 
regulating AS (McManus and Graveley 2011; Yang et al. 
201 1). Both mechanisms as a whole have tissue- or species- 
specific functions (Pan et al. 2005; Nurtdinov et al. 2007; 
Calarco et al. 201 1) and have implications in the generation 
of developmental stages (Blencowe 2006; Xing and Lee 2007; 



Gingeras 2007; Licatalosi and Darnell 2009). Moreover, AS 
seems to be involved in several diseases (Villate et al. 2008; 
Irimia and Blencowe 2012; Colak et al. 2013), because it is a 
very complex process with several specific splicing site 
sequences and many RNA-binding proteins, where mutations 
or misregulation may cause disease (Cooper et al. 2009). 

It has also been demonstrated that AS is one of the main 
evolutionary mechanisms in mammalian genomes (Nurtdinov 
et al. 2007; Merkin et al. 2012; Barbosa-Morais et al. 2012). 
Diversity in mammalian transcripts seems to be a consequence 
of species-specific AS, mainly related to exons conservation 
among species (Pan et al. 2005). Furthermore, evidences 
of functional selective pressure for AS events have been 
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previously reported (Xing and Lee 2005). However, because 
the canonical splice variants carry out critical gene activities, 
the nninority ones may not often have a major impact on 
physiology (Xing and Lee 2005; Blencowe 2006). Therefore, 
alternative spliced exons that belong to minority isoforms or 
species-specific isoforms seem to evolve more rapidly than 
constitutive ones (Xing and Lee 2005). AS has been described 
to play an important role in immune system in swine models 
(Lynch 2004; Sinkora et al. 2005), and in signal transduction 
and susceptibility to diseases in relation with the evolution of 
primate species-specific characteristics (Calarco et al. 2007). It 
should be taken into account that the majority of these kinds 
of analyses have been based on predictive computational 
methods and/or in cDNA sequences deposited in different 
databases (Pan et al. 2005; Xing and Lee 2005; Nurtdinov 
et al. 2007). 

In this study, we focus on the receptor for advanced gly- 
cosylation end products (RAGE) as a complex spliced gene 
in mammals (Yonekura et al. 2003; Hudson et al. 2008; 
Kalea et al. 2009; Sterenczak et al. 2009). RAGE (also 
termed ACER) is a member of the immunoglobulin (Ig) 
superfamily and a cell surface receptor for a broad variety of 
ligands including advanced glycation end products (Neeper 
et al. 1992), members of the SlOO/calgranulin superfamily 
(Hofmann et al. 1999), amphoterin, also known as HMGB1 
(Hori et al. 1 995), an amyloidogenic form of serum amyloid A 
and beta amyloid (Ap) peptide (Yan et al. 1996). Binding of 
these ligands to RAGE entails the activation of diverse signal 
transduction pathways implicated in cell growth and prolifer- 
ation (Huttunen et al. 1999; Kislinger et al. 1999). The dereg- 
ulation of this process seems to be related with several 
diseases (Kalea et al. 2011) including diabetes (Goh 
and Cooper 2008), immune and inflammatory disorders in 
arteriosclerosis (Harja et al. 2008), Alzheimer's disease 
(Ding and Keller 2005a, 2005b; Emanuele et al. 2005), and 
invasiveness in various cancers (Hiwatashi et al. 2008; 
Ghavami et al. 2008). 

Several splice variants of RAGE have been previously 
described in human (Malherbe et al. 1999; Yonekura et al. 
2003; Ding and Keller 2005b; Hudson et al. 2008), murine 
(Kalea et al. 2009), and canine tissues (Sterenczak et al. 2009, 
2013). RAGE is composed of three extracellular Ig-like do- 
mains: an Ig V-set domain (Ig-V domain) and two Ig C-set 
domain (Ig-C domain); and a single transmembrane region 
with a short cytoplasmic tail. Splice variants result in changes 
that might affect the extracellular ligand-binding domain of 
RAGE by different modifications of the region encoding the 
Ig-V domain as well as the removal of the sequence encoding 
the transmembrane region allowing to generate secreted 
RAGE (sRAGE) isoforms (Hudson et al. 2008). 

Here we report a deep description of RAGE splice variants 
based on experimental data and compared with RNA-seq data 
obtained from databases. We have identified by nested 
reverse transcription-polymerase chain reaction (RT-PCR) 



numerous novel splice variants of RAGE in 6-18 tissues of 
six different mammals, and we have performed a wide com- 
parison among them. Furthermore, we have performed a de- 
tailed analysis of high-throughput RNA sequences (RNA-seq) 
from brain and liver samples of five of the six analyzed mam- 
mals, which data were obtained from Pan et al. (2008), 
Barbosa-Morais et al. (2012), Merkin et al. (2012), and 
ENCODE Project. This is the first comparison of RAGE splice 
variants among several species within the orders primates, 
artiodactyla, and rodentia, and our results suggest a special 
evolution pattern of this gene at the mRNA level. Additionally, 
we analyzed the relative expression levels of the different 
splicing variants in the studied tissues and species, as well 
as the relative abundance of coding and noncoding tran- 
scripts showing how the noncoding ones are extensively 
represented. 

Materials and Methods 

Analyzed Tissues 

For the screening of RAGE splice variants, we analyzed mRNAs 
from six mammalian species corresponding to different tissues, 
including adult, embryonic, and cancer tissues, that were ob- 
tained from BioChain through one of their European distribu- 
tor "AMS Biotechnology" as follows: human adult (blood, 
brain, kidney, liver, lung, pancreas, spleen, and thymus), 
fetal (brain, heart, kidney, liver, lung, spleen, and thymus), 
and human cancer tissues (kidney, liver, and lung); rhesus 
monkey adult tissues (brain, heart, kidney, liver, lung, and pan- 
creas); bovine and swine adult tissues (brain, heart, kidney, 
liver, lung, and spleen); mouse adult (brain, heart, kidney, 
liver, lung, and pancreas) and mouse embryonic tissues (1 1, 
15, and 17 days of development); and rat adult (brain, heart, 
kidney, liver, lung, pancreas, and spleen) and rat embryonic 
tissues (1 3 days of development) (for tissue details see supple- 
mentary table SI, Supplementary Matenal online). 

Splice Variants Identification 

One microgram of total RNA from each tissue was used for 
oligo-dT primed cDNA synthesis, which was performed using 
the ImProm II Reverse Transcription System (Promega) follow- 
ing the manufacturer's instructions. All cDNAs were checked 
by amplification of the housekeeping beta-actin gene expres- 
sion as proof of reliable cDNA template, with the following 
foHA/ard and reverse primers: Actin_F 5'CTTCGCGGGCGACG 
ATGC3' and Actin_R 5'TGGTGGTGAAGCTGTAGCC3'. 
The splicing isoforms were amplified by nested PCR from 
the different tissues cDNAs with GoTaq Green Master Mix 
(Promega). The PCR conditions were as follows: 95 °C for 
5min followed by 30 cycles of 95 °C for 1 min, 60 °C for 
2min, 72 °C for 2 min, and ending by 72 °C for 5 min. 
The first round was performed with 1 |il of cDNA in a 20 |il 
reaction volume and with external forward and reverse 
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primers (extF and extR) as follows: H. sapiens and M. mulatta 
(RAGE_extF 5'CAGGACCCTGGAAGGAAGCAGG3' and 
RAGE_extR 5'CTGGTTGTAGAAGAAAGCTTGGC3'), B. taurus 
(BtRAGE_extF 5'AGCCTGGGAAGGAAGCAGG3' and 
BtRAGE_extR B'GGTTGTGGAAGAAAGTTTGGGB'), S. scrota 
(SsRAGE_extF 5'AGCCTGGGAAGGAAGCAGG3' and 
SsRAGE_extR 5'GGGTGAACTGGTCTGGGGCC3'), and R. nor- 
vegicus and Mu. musculus (RnRAGE_extF 5'AGCCTGGGAAG 
GAAGCACC3' and RnRAGE_GxtR 5'ATCATGTGGGCTCTGGTT 
GG3'). Primers were designed to amplify from start to stop 
codon of tlie canonical full-length RAGE gene (Hudson et al. 
2008). For the second round, we used primers just down- 
stream of the first round ones called internal forward and 
reverse primers (intF and intR), as follows: H. sapiens and 
M. mulatta (RAGEJntF 5'AGCCGGAACAGCAGTTGG 
AGCC3' and RAGEJntR 5'GCCCTCCAGTACTACTCT 
CGCC3'), B. taurus (BtRAGEJntF 5'GGGGCAGTGGTCGGAG 
CC3' and BtRAGEJntR 5'CTCCTGTGCTGCTCTCTGCC3'), 
5. scrota (SsRAGEJntF 5'GCAGCAGGGACAACGGCC3' and 
SsRAGEJntR 5'GCTCCTGCACTGCTCTCAGCC3'), and R nor- 
vegicus and /Wu. musculus (RnRAGE_intF 5'ACAGCAGCTAGA 
GCCTGGG3' and RnRAGE_intR 5'CCCGGCACCATTCTCTGG 
C3'). The second round was performed with 1 |il of a 1:10 
dilution of the first round product as template on a final 
volume of 20|il and only 20 cycles. To verify product size 
and experimental reproducibility, two independent PGR exper- 
iments were performed and loaded on a 2% agarose gel. The 
splicing isoforms products were gel purified as single bands, 
except from human and rhesus monkey where 1-1. 5 kb 
bands were extracted at the same time, using the Wizard 
SV Gel and PGR Clean up System (Promega), and then 
cloned into the pGEM-T Easy vector (Promega). The resultant 
plasmids were purified by Wizard Plus SV Minipreps DNA 
Purification Systems (Promega) and sequenced using 17 (5'T 
AATACGACTCACTATAGGG3') and Sp6 (5'GATTTAGGTGACA 
CTATAGAATAC3') primers by the Genomics Unit at the 
Parque Cientifico de Madrid (http://vvww.fpcm.es/, last 
accessed December 2, 2013). Relative abundance of each 
splice variant was measured as the result of the gel intensity 
for each single band using ImageJ software (http://rsb.info.nih. 
gov/ij/index.html, last accessed December 2, 201 3). In the case 
of 1-1 .5 kb bands, their relative abundance was related to the 
number of colonies found from the 48 total colonies checked 
in each tissue and measure as follows: lower than 20% of the 
total colonies was considered as low expression, 20-40% as 
moderate expression, 40-50% as moderate-high expression, 
and 50% of the total colonies or higher was considered as 
high expression levels. 

Bioinformatics 
General Analysis 

Sequence data were annotated using the BioEdit software 
(http://vvww.mbio.ncsu.edu/BioEdit/bioedit.html, last accessed 



December 2, 2013) and aligned to Tvl-RAGE mRNA (cDNA) 
and to genomic RAGE DNA in each species using EBI Pairwise 
Alignment (http:/AAAAAA/.ebi.ac.uk/Tools/psa/, last accessed 
December 2, 2013) to obtain the exonic composition of 
each isoform and to compare it to the canonical one. The 
translation of each splice variant to protein was performed 
using ORE Finder (www.ncbi. nlm.nih.gov/gorf/, last accessed 
December 2, 2013). Pfam (http://pfam.sanger.ac.uk/, last 
accessed December 2, 2013) was used for general domain 
prediction and SignalP4.0 Server (http://vvvvw.cbs.dtu.dk/ser- 
vices/SignalP/, last accessed December 2, 201 3) for signal pep- 
tide prediction. Splice variants with less than one protein 
domain and one or more premature stop codons (PSC) 
were considered as noncoding RNA (ncRNA) variants. mRNA 
splice variants were named as Sp_RAGE_VN (where species 
was abbreviated by Gsss = first letter of genus and the three 
first letters of the species name, VN = mRNA variant number). 
The sequence data from this study have been submitted to 
GenBank with the accession numbers: H. sapiens 
(GUI 39370-84), M. mulatta (GUI 39408-22), B. taurus 
(GUI 39385-407), S. scrota (GUI 64706-1 4), Mu. musculus 
(GU164727-39) and R. norvegicus (GU164715-26) (For de- 
tails see supplementary fig. SI, Supplementary Material 
online). 

RNA-Seq Data Analyses 

RNA-seq data from brain and liver tissues of different species 
were obtained from the National Center for Biotechnology 
Information (NCBI) Sequence Read Archive (SRA) database, 
single reads of SRP007412 (Barbosa-Morais et al. 2012) 
and paired reads of SRP016501 (Merkin et al. 2012). Paired 
reads of human Monocytes-CD14H- from ENCODE project 
(SRX1 79434) were also analyzed. Reads were mapped to 
the genome of each species, obtained from the EnsembI 
Database {H. sapiens: GRCh37, M. mulatta: MMUL_1, 
B. taurus: UMD3.1, Mu. musculus: GRCm38, R. norvegicus: 
Rnor_5.0), using the default parameters of TopHat and 
Bowtie (Trapnell et al. 2009; Langmead et al. 2009). 
Junction reads of the BED files (TopHat outfile) were normal- 
ized (per million of mapped reads) to compare splice events 
among samples. Samtools software (Li et al. 2009) was used 
for indexing BAM files (TopHat outfile), conversing alignment 
formats, and sorting the gene region of interest. The resulting 
RAGE-aligned reads were used to assemble them into 
transcripts using Cufflinks (Trapnell et al. 2010) by both de- 
fault (-F 0.1 and -j 0.15) and modified (-F 0 and -j 0.5) param- 
eters. Additionally, the experimentally described transcripts 
were provided to Cufflinks by a GTF file, and their relative 
abundances were estimated. The expression of each transcript 
was measured in Fragments Per Kilobase of exon model per 
Million mapped fragments. The IGV browser (Robinson et al. 
201 1 ) was used for visualization of mapped reads and assem- 
bled transcripts to its genome context. 
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Results 

Primates RAGE Splice Variants 

Identification and Characterization of IHuman RAGE Splice 
Variants 

To fully characterize H. sapiens RAGE splice variants, we per- 
formed specific nested RT-PCR assays in 18 different tissues. 
Detailed analyses of these results showed a total of 22 differ- 
ent variants obtained from eight adult, seven fetal, and three 
tumoral tissues (fig. 1). Interestingly, 15 of these isoforms 
were novel RAGE splice variants (different exon/intron struc- 
ture to the previously described ones) named HsapRAGEvI to 
v15 (detailed information regarding these novel variants and 
the previously described ones is available in supplementary fig. 
SI, Supplementary Material online). Transcript variant 1 (also 
known as full-length RAGE, and hereafter Tvl-RAGE) was 
identified in this study as one of the most abundant splice 
variants (fig. 1), in agreement to previous descriptions 
(Hudson et al. 2008). This isoform is composed by a signal 
peptide encoded within exon 1 , an Ig-V domain within exons 
2 and 3, two Ig-C domains within exons 4-6 and 7-8, respec- 
tively, and a single transmembrane region within exon 10 
followed by a short cytoplasmic tail within exon 1 1 (Hudson 
etal. 2008) (fig. 1). 

Other found alternative splice variants show changes in 
these domains. For instance, intron 1 retention in TvlO- 
RAGE (previously named N-truncated) and HsapRAGEvI to 
v8 splice variants results in a PSC, but there is an alternative 
downstream potential start codon located at exon 3 exclud- 
ing the signal peptide (which could affect to the function- 
ality of the protein) and the Ig-V domain from their 
potential encoded isoforms (fig. 1). TvlO-RAGE was recently 
defined as a noncoding variant by NCBI, although several 
authors consider it as a coding isoform that lacks the Ig-V 
domain (Yonekura et al. 2003; Sterenczak et al. 2013). 
TvlO-RAGE variant only presents this splice event, whereas 
WsapRAGEvI to v8 present a combination of other addi- 
tional AS events (fig. 1). Another common splicing event is 
the inclusion of part of intron 9 and skipping of exon 10 
(Tv6-RAGE or C-truncated, Tv4-RAGE, HsapRAGEvI to v6, 
v8, and v9 splice variants) that results in the loss of the 
transmembrane region and cytoplasmic tail and, therefore, 
generating potential sRAGE protein isoforms. We have also 
found other different splicing events, such as the use of a 
different 5' -donor splice site in exon 4 (Tv2-RAGE and Tv4- 
RAGE, previously described by Hudson et al. 2008), that 
would cause variations in the region encoding the 
corresponding Ig-C domain and therefore, this could 
modify the ligand specificity and/or the binding affinity. In 
addition, we found several predicted noncoding short var- 
iants (RAGE_v11, HsapRAGEvI 0 to v15 splice variants), 
which present skipping of several exons and use of differ- 
ent acceptor splice sites (fig. 1). 



We have also assessed the relative abundance of each 
splice variant showing differences in expression levels 
among all the analyzed tissues. Tvl-RAGE, Tv2-RAGE, and 
TvlO-RAGE variants were the most prevalent of the detected 
transcripts in the majority of the analyzed tissues. Moreover, 
large splice variants with different intron retentions show low 
but uniform expression among the different tissues. By con- 
trast, small splice variants also show low expression levels but 
with a very specific distribution on particular tissues, even 
among embryonic or tumoral states within the equivalent 
tissue. All the analyzed tissues show this similar pattern of 
expression levels, but interestingly we have observed that 
adult and fetal lung only express Tvl-RAGE and Tv6-RAGE 
(C-truncated) variants at high levels, whereas tumor lung 
expresses all the different large splice variants but at low 
levels. Nearly 40% of the total analyzed variants in human 
are noncoding (fig. 2). 

Identification and Characterization of Rhesus Macaque 
RAGE Splice Variants 

In M. mulatta, we detected in total 1 5 novel variants of RAGE 
within six different adult tissues (fig. 3), and one of them 
was the predicted canonical Tvl-RAGE (NM_0012051 17- 
GU139417). We have named these splice variants as 
Mmu/RAGEvI to v15. Only Tvl-RAGE was previously anno- 
tated on databases based on homology to human, but we 
show here the first biological evidence for this isoform. 
/Wmu/RAGEvS and Tvl-RAGE from M. mulatta are directly 
related to the exon/intron structure of TvlO-RAGE and Tvl- 
RAGE human splice variants, respectively. Furthermore, 
we found an isoform (/WmL//RAGEv3) that retains introns 1 
and 6 and is related to an isoform (NtRAGEA) previously 
described in humans (supplementary fig. SI, Supplementary 
Material online) and not detected by us, involved in 
Alzheimer's disease (Ding and Keller 2005a, 2005b). None 
of the remainder macaque variants have a human homolo- 
gous variant (table 1). 

Inclusion of intron 1 is shown in Mmu/RAGEvI to v5, in 
which /Wmtv/RAGEvI is predicted as ncRNAs due to the pres- 
ence of different PSC in introns 1 and 6 or exon 7. The whole 
inclusion of intron 6 in /Wmu/RAGEv6 and its partial inclusion 
in /V/mL//RAGEv9 resulted in the appearance of PSCs. The 
potential encoded isoforms of these two variants could 
be new sRAGE isoforms lacking also the last IgC domain 
(fig. 3). We found changes in the amino acid sequence of 
the Ig-V domain in Mmu/RAGEv1 1 which turns it into an 
Ig-C domain as a result of the use of a 5'-donor alternative 
splice site in exon 3. Moreover, an alternative 3'-acceptor 
splice site in exon 2 detected in MmulRAG£\/8 transcript 
implies changes in the reading phase and, due to that a PSC 
appears, although an isoform could be generated from a 
downstream start codon in exon 3 (fig. 3). In addition, 
we found four smaller isoforms (MmL;/RAGEv12 to v15). 
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Fig. 1. — RAGE transcript variants in Homo sapiens. Representation of human RAGE splice variants and their relative expression levels in different 
adult (A), fetal (F), and tumor (T) tissues. Potential protein domains are shown with different colors, and noncoding sequences are shown in white. 
Junctions from the canonical variant are shown as straight lines, whereas the alternative junctions are shown as open triangles. Splice variants are arranged 
by length. 



which present skipping of large number of exons, one of them 
could be a potential coding variant (Mmtv/RAGEv12), whereas 
the other three are predicted as ncRNA. Noncoding variants 
form around 30% of the total monkey RAGE isoforms (fig. 2). 

Macaque tissues show different expression levels of each 
variant (fig. 3). Tvl-RAGE, Mmu/RAGEvS, Mmu/RGEv5, and 
/Wmc//RAGEv3 were the most abundant variants among 



different tissues. /Wmu/RAGEv8 is overrepresented in brain, 
whereas macaque Tvl-RAGE is overrepresented in lung 
tissue, as it happens with its homologous human variant. As 
Mmu/RAGEvS lacks the IgV domain in relation to Tvl-RAGE, 
the overexpression in brain could indicate that Mmu/RAGEvS 
is probably binding to another ligand. However, we have to 
consider that it would not encode a signal peptide. Small 
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Fig. 2. — Relative abundance of coding versus noncoding transcript 
analyzed. Stacked percentage bars reflect the portion of previously 
described variants and the variants that we found in our study (coding 
and noncoding) present in different analyzed species. 



noncoding variants show low expression levels and are only 
present in lung tissue, with the exception of the potential 
coding /Wmu/RAGEv12 that it is only expressed in liver. 

Artiodactyla RAGE Splice Variants 

Identification and Characterization of Bovine RAGE Splice 
Variants 

In 6. taurus we have detected a total of 24 alternative splice 
variants of RAGE within six different adult tissues (fig. 4), 23 
of them were novel alternative splice variants (named 
as BfauRAGEvI to v23), Tvl-RAGE being the only one previ- 
ously described by Neeper et al. (1 992) (NM_1 73982) (supple- 
mentary fig. SI, Supplementary Material online). Tvl-RAGE 
and SfauRAGEv? isoforms are related structurally to the 
similar human and macaque variants (table 1). However, 
6fauRAGEv7 splice variant leads to several changes at protein 
level when compared with the primates counterparts, due to 
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Fig. 3. — RAGE transcript variants in Macaca mulatta. Representation of monkey RAGE splice variants and their relative expression levels in different adult 
tissues. See figure 1 for details. Mmu/RAGEv5 (bold) is related to the human TvlO-RAGE. 
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Table 1 

Conserved RAGE Variants Based on Exon/lntron Composition across Mammals 
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an alternative downstream start codon located at exon 2, 
instead of exon 3. Only BteuRAGEvI and fifauRAGEv2 could 
generate N-truncated proteins similar to the human one but, 
in this case, due to the retention of introns 1 and 3 or introns 1 
and 2, respectively. 6teuRAGEv22 is similar to HsapRAGEvM, 
being predicted as ncRNAs, and 6teuRAGEv3 and v6 have 
homologs in macaque (table 1). Furthermore, sRAGE isoforms 
could be generated by exon 8 skipping (BfauRAGEv9, vIO, 
and v13), using alternative donor sites (BfauRAGEv12), or by 
intron retention (SfauRAGEv3), but in all these cases lacking 
the second Ig-C domain. In addition, several noncoding splice 
variants are found with large "deletions" (BfauRAGEvM to 
v23), or small ones and intron retentions (SteuRAGEvS 
and v1 1), which represent around 50% of whole transcripts 

(fig. 2). 

In relation to relative abundance of each splice variant, 
figure 4 shows high and wide expression levels of 
SfauRAGEv? splice variant, whereas the rest of splice variants 
show in general low and tissue-specific expression levels. In 
contrast to human, bovine Tvl-RAGE presents low expression 
levels in the lung tissue. 

Identification and Characterization of Swine RAGE Splice 
Variants 

Sus scrota only shows 10 alternative splice variants of RAGE 
within the six different analyzed adult tissues (fig. 5), becom- 
ing the species with fewer variants, even when comparing the 
same number and type of tissues. Databases searches only 
show the predicted Tvl-RAGE isoform, whereas the rest are 
novel alternative splice variants (named SscrRAGEvI to v9). 
Tvl-RAGE and SscrRAGEvS isoforms are again the only two 
variants similar to human splice variants (Tvl-RAGE and TvlO- 
RAGE, respectively, table 1), showing the same domains as 
the humans counterparts. Retentions of intron 1 and/or 3 are 
common events in swine RAGE (SscrRAGEvI to v7 and v9). 
These events result in a PSC and a downstream start codon 



located at exon 4, instead of exon 3 in primates or in exon 2 in 
B. taurus, which could encode proteins that lack the Ig-V 
domain and the signal peptide. SscrRAGEvI isoform is related 
to bovine BfauRAGEvI isoform with similar protein domains 
(table 1). Furthermore, soluble isoforms could also be present 
in swine RAGE, as a result of the use of an alternative 
3'-acceptor splice site in exon 10 (SscrRAGEv? and v8) and 
therefore lacking the transmembrane region. On the other 
hand, swine RAGE only shows two predicted noncoding 
splice variants (SscrRAGEv2 and v9), representing 20% of all 
variants detected (fig. 2). 

Expression levels of the different splice variants in swine 
(fig. 5) are quite different compared with human ones. In 
swine, SscrRAGEv5 isoform, which is equivalent to human 
TvlO-RAGE, is also highly and widely expressed in all the 
analyzed tissues, even in lung tissue where the human homo- 
log is not expressed. However, Tvl-RAGE expression is mod- 
erate and only in two tissues (kidney and spleen), remarkably, 
there is no Tvl-RAGE expression in lung (fig. 5). 

Rodents RAGE Splice Variants 

Identification and Characterization of Mouse RAGE Splice 
Variants 

We detected a total of 17 alternative splice variants of RAGE 
from Mu. musculus within different adult and embryonic tis- 
sues (fig. 6). Tvl-RAGE was already in databases, and 
mRAGE_v2, v4, and v5 were previously described (Kalea 
et al. 2009) (supplementary fig. SI, Supplementary Material 
online). The other 13 splice variants are novel and named as 
MmtvsRAGEvI to v1 3 (fig. 6). Only two of these variants, Tvl - 
RAGE and mRAGE_v2 are related, in terms of exon/intron 
structure, to the equivalent splice variants in other analyzed 
mammalian species (table 1). However, mRAGE_v2 presents 
another PSC in exon 5, not present in the other homologs, 
and as a result, mRAGE_v2 is a noncoding variant (fig. 6). 
There is a potential soluble murine splice variant 
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Fig. 4. — RAGE transcript variants in 60s taurus. Representation of bovine RAGE splice variants and their relative expression levels in different adult tissues. 
See figure 1 for details. SfauRAGEv? (bold) is related to the human TvlO-RAGE. 



(MmusRAGEv5) related to a swine one (SscaRAGEvS) (table 1). 
In addition, other potential soluble variants containing the 
signal peptide and the first two Ig domains could be generated 
(/WmusRAGEv2 to v6) (fig. 6), and three by the presence of 
stop codons before exon 10 (/WmusRAGEv2, v3 and v5). 
Harashima et al. (2006) had previously described an isofornn 



(esRAGE) that could generate soluble proteins (by intron 9 
retention), but we have not been able to identify it (supple- 
mentar/ fig. SI, Supplementary Material online). Another 
event that could produce domain alterations is present in 
/WmusRAGEvI due to the use of an alternative 3'-acceptor 
splice site in exon 3 and, therefore, removing the Ig-V 
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Fig. 5. — RAGE transcript variants in Sus scrofa. Representation of porcine RAGE splice variants and their relative expression levels in different adult 
tissues. See figure 1 for details. Splice variants are arranged by length. Ssc/fiAGEvS (bold) is related to the human TvlO-RAGE. 



domain. We found several noncoding splice variants of which 
nnRAGE_v5 and mRAGE_v2 were previously described, and 
Mmt/sRAGEv? to v1 3 being novel from this study (supplemen- 
tary fig. SI, Supplementary Material online). In total, they 
represent more than 50% of murine splice variants (fig. 2). 

Relative abundance of each splice variant shows differences 
in expression levels among tissues (fig. 6) as well as compared 
with other mammals analyzed in this study. Tvl-RAGE 
presents, as in humans, high and wide expression, either in 
embryonic or adult tissues. Nevertheless, the noncoding 
mRAGE_v2 (equivalent in structure to potential coding 
human TV10-RAGE) shows low expression levels in brain, 
liver, and 1 1 days embryonic tissue, and only shows moderate 
expression levels in pancreas adult tissue, in contrast to the 
human homolog which is highly expressed. Other splice 
variants show low expression levels but with a specific distri- 
bution among tissues and state of development. 

Identification and Characterization of Rat RAGE Splice 
Variants 

We detected a total of 1 3 alternative splice variants of RAGE 
from R. norvegicus within different adult and embryonic 
tissues (fig. 7). Tvl-RAGE sequence was previously found 
based on homology, whereas the other 12 were novel 



alternative splice variants named as RnoARAGEvI to v12 (see 
supplementary fig. SI, Supplementary Material online). 
Tvl-RAGE and /?noARAGEv2 isoforms are related to human 
splice variants, as well as in the other analyzed species 
(table 1). Rat /?norRAGEv2, although related structurally to 
human TVIO-RAGE, shows changes at protein level being 
similar to bovine SfauRAGEv? isoform, with the Ig-V domain 
present (fig. 7). 

Intron 1 retention is present in some splice variants 
(RnorRAGEvI , v2, v4, and v8) resulting in PSC on the retained 
intron, with an alternative downstream start codon located at 
exon 2 that conforms potential coding splice variants, and 
noncoding splice variant when intron 3 is also retained 
(RnorRAGEvI) or due to several exon skipping events 
(RnoARAGEvS). Some rat RAGE splice variants are related to 
murine RAGE splice variants (table 1), of which /?norRAGEv3 
(related to mRAGE_v4) is broadly expressed and quite similar 
in protein structure to TVI-RAGE as it only lacks the short 
region between the second IgC. Noncoding splice variants 
(RnorRAGEvI and /?no/1^AGEv6 to v12) represent more than 
60% of ncRNA in R. norvegicus (fig. 2). 

Relative abundance of each rat splice variant shows differ- 
ences in expression levels among tissues, especially noncoding 
variants that show specific distribution around tissues and 
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Fig. 6. — RAGE transcript variants In Mus musculus. Representation of mouse RAGE splice variants and their relative expression levels in different adult 
and embryonic tissues. See figure 1 for details. mRAGE_v2 (bold) is related to the human TvlO-RAGE. 



ennbryonic developmental stages. Tvl-RAGE presents the 
highest expression levels in all tissues, with the exception of 
the liver where it is not expressed. Lung tissue shows similar 
expression levels to human ones, with high levels of Tvl -RAGE 
and low levels of /?norRAGEv3 and smaller noncoding 
isoforms. RnofRAGEvI and /?norRAGEv2 variants show 
moderate and moderate-high levels in adult tissues, the last 
one in agreement with the human homolog TV10-RAGE. In 
embryonic tissue, Tvl-RAGE and RnorRAGEvS are the most 
abundant transcripts (fig. 7). 

Identification of RAGE Splice Variants by 
High-Throughput Technologies 

Nowadays, RNA-seq data are a powerful tool to perform 
transcriptional studies, from description of new isoforms to 



quantification of all transcripts. For this reason, we decided 
to carry out high-throughput RNA analyses of sequencing 
data deposited in the SRA database (Pan et al. 2008; 
Barbosa-Morais et al. 2012; Merkin et al. 2012) to compare 
them with our nested RT-PCR results. We mapped these pub- 
lically available RNA-seq reads from brain and liver tissues 
to each genome of reference (human, monkey, bovine, rat, 
and mouse), and afterward, relative abundances of RT-PCR 
detected RAGE transcripts were estimated using Cufflinks. 
The analyses confirmed that RAGE is a very lowly expressed 
gene due to a minimum or null coverage (supplementary table 
S2, Supplementary Material online). In human, Tvl-RAGE 
shows moderate expression in brain and Tv10_RAGE in liver, 
in accordance with our RT-PCR results. In addition, macaque 
/Wmu/RAGEv2 and Mmc//RAGEv6 variants are the mainly 
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Fig. 7. — flA,GE transcript variants in Rattus norvegicus. Representation of rat RAGE splice variants and their relative expression levels in different adult and 
an embryonic tissue. See figure 1 for details. RnorRAGEv2 (bold) is related to tlie human TvlO-RAGE. 



expressed in brain and liver followed, in brain, by Tvl-RAGE, 
wliich is not expressed in liver. These RNA-seq results are not 
in agreement with the RT-PCR ones where /V/mtv/RAGEv8 is 
more expressed in all tissues followed by Tvl-RAGE. In the 
bovine analyses, SfatvRAGEvI , v2, and v4 are highly expressed 
in brain and SfauRAGEvI and v3 in liver (with some correlation 
to our PCR results). Moreover, bovine Tvl-RAGE is moderately 
expressed in brain but not expressed in liver, whereas 
SfauRAGEv7, which is very highly detected by PCR, shows 
no expression in RNA-seq samples. In mouse RNA-seq sam- 
ples, the mainly expressed variants are Tvl-RAGE (as by our 
PCR results) and mRAGE_v5 in both tissues. In addition, we 
observe similar expression results between both technologies 
in Tvl-RAGE, RnorRAGEvI, and v2 rat variants. Nevertheless, 
it is important to observe that all expression levels show 
extremely low values. 

Furthermore, we analyzed data from a human sample 
(monocytes) from the ENCODE project with a very high 
number of reads. Using default parameters, Cufflinks was 
only able to assemble three transcripts (supplementary fig. 



S2, Supplementary Material online) which do not correlate 
with those we found by nested RT-PCR, even the canonical 
transcript could not be assembled. As Cufflinks suppresses the 
output of smaller fractions of splice variants, we approached 
this analysis altering the parameters which modify the minimal 
isoform fraction and the pre-mRNA fraction. As a result, 
we were able to assemble Tvl-RAGE, TvlO-RAGE, and 
f/sapRAGEvS (supplementary fig. S2, Supplementary 
Material online) in the human monocytes sample. We 
extended this approach to the previously analyzed brain and 
liver samples in all the different species. Tvl-RAGE was 
detected in only one brain human sample, and N-truncated- 
related variants were assembled in macaque brain 
(Mmtv/RAGEvB) and both bovine tissues (SfauRAGEv?). 
MmL//RAGEv6 was also assembled in macaque brain, (supple- 
mentary fig. S2, Supplementary Material online). No variants 
could be assembled in the rodent samples. 

Observing that Cufflinks analyses were not able to assem- 
ble all the isoforms detected by RT-PCR, not even the mainly 
expressed ones, we decided to analyze the number of splicing 
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events detected by TopHat, by using the reported junctions. 
We were able to detect most of the canonical splice events but 
only in those samples which have high read coverage. It is 
interesting to notice that the splice event between exons 1 
and 2 is not present in any bovine sample, in accordance to 
our results obtained by RT-PCR. For that reason, we were not 
able to assemble the Tvl-RAGE isoform using Cufflinks, 
and the equivalent to the human N-truncated variant was 
the only isoform assembled in these samples. Moreover, a 
few alternative spliced events were also detected in all the 
species (supplementary fig. S3, Supplementary Material 
online) but all of them with extremely low expression values. 
Some of these events showed values between 0.004 and 
0.008 junction reads per million of mapped reads, which 
correspond to just one or two junctions found in the total 
sample. These events show a similar tissue expression pattern 
as our experimental results. For instance, partial retention of 
intron 6 in rhesus monkey is lowly detected in brain, as the 
variants which contain this splice event (/Wmu/RAGEvB, v7, 
and v9). The same consistent results are shown in the skipping 
of exon 9 event in rat samples, with low levels in brain 
observed by both approaches. 

Discussion 

In this study, we have deeply characterized a high number of 
alternative splice variants of RAGE in different mammals and 
tissues, and we have compared their relative expression levels. 
A diversity of functions of RAGE has been previously described 
as a result of AS events (Yonekura et al. 2003; Hudson et al. 
2008; Kalea et al. 2009; Sterenczak et al. 2009), which can 
modify protein domains. Previous studies focused on splice 
variants of RAGE in human (Hudson et al. 2008) and mouse 
(Kalea et al. 2009) showed similar results in expression levels of 
each splice variant. In human, RAGE had only been studied in 
detail in lung and aortic smooth muscle cells (AoSMCs) 
(Hudson et al. 2008). Comparison of adult lung in previous 
(Hudson et al. 2008) and our studies show similar expression 
levels for Tvl-RAGE (80% and high expression levels, respec- 
tively) and different for Tv6-RAGE (C-truncated) (7% and 
moderate expression levels, respectively). Interestingly, in our 
study adult and fetal lung were the only two tissues showing 
these two isoforms, whereas tumor lung presents a high 
number of different splice variants. 

Previous studies related to RAGE splice variants in different 
mouse tissues have been performed (Harashima et al. 2006; 
Kalea et al. 2009). Harashima et al. (2006) identified a soluble 
isoform on brain using nested PCR, and the relative abun- 
dance among other tissues and isoforms was quantified 
using a small PCR (exons 9-11) so these cannot be compared 
with our results. In that work, they also describe a soluble 
isoform (esRAGE) not found in our study, but they generated 
it with a primer in the second round of PCR specific to the 
intron 9 retained. In the study by Kalea et al. (2009), murine 



lung tissue showed both Tvl-RAGE (53%) and mRAGE_v4 
(41%) expressions, and we have found high expression and 
low expression, respectively. The smaller isoforms they found 
are different from those described here, but they also detected 
them at very low expression levels (1 % in each splice variant). 
In general, our results are somewhat consistent with previous 
studies (Hudson et al. 2008; Kalea et al. 2009) as percentages 
higher than 50% are related to our high expression levels, 
50-40% to moderate-high, and 40-20% to low, in most 
cases. Lower than 20% is sometimes not detected in our 
study, and, on the other hand, low-expression variants 
detected by us were not detected by them. In both human 
and mouse previous studies (Hudson et al. 2008; Kalea et al. 
2009), the splice variants were analyzed by RT-PCR and 
restriction enzyme recognition sequences, whereas we sepa- 
rated the RT-PCR product electrophoretically, cloned and ver- 
ified by sequencing. Both approaches use nested PCR, by 
reason of the extremely low expression levels of RAGE, 
which makes it impossible to approach the study in the dy- 
namic PCR range. Therefore, this methodology could slightly 
influence the findings limiting the power to differentiate splice 
variants expression and their relative abundance. For this 
reason, low-expression variants, which are the most difficult 
to detect, sometimes showed differences among these stud- 
ies. Indeed, the optimal approach for determining the relative 
expression of each transcript should be a quantitative RT-PCR 
(qRT-PCR). However, because of the presence of multiple 
splice events common to several isoforms is completely impos- 
sible to design a qRT-PCR assay to discriminate one against 
each other. Nevertheless, this technique was used in canine 
RAGE (Sterenczak et al. 2009) to describe splice variants in 
healthy and tumor tissues, but these analyses were performed 
only with the intron 1 retention event. They detected high 
expression levels of intron 1 retained variants in neoplastic 
samples. On the other hand, intron 1 skipping was appointed 
to healthy transcript variants and, therefore, with lower ex- 
pression levels of the intron 1 retention event in healthy tissues 
(Sterenczak et al. 2009). This assay was performed as a global 
effect of the intron 1, not trying to discriminate among the 
different variants. The wide diversity of intron 1 retention 
splice variants that we found cannot be differentiated against 
each other with that assay and, for this reason, in our study, 
we find splice variants with intron 1 retention that are highly 
expressed in healthy adult and fetal tissues, and intron 1 skip- 
ping splice variants also present in tumor tissues with different 
expression levels. 

Nowadays, the high-throughput sequencing technology is 
being employed to describe the complete transcriptome of 
several organisms (Barbosa-Morais et al. 2012; Merkin et al. 
2012; Rastrojo et al. 2013). This technique provides informa- 
tion on differential expression of genes, posttranscriptional 
mutations or edition, and gene fusions, among others. 
Furthermore, differences in transcript expression levels 
should also be observed, but sufficient coverage is required. 
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In the case of RAGE, both low expression and high transcrip- 
tomic complexity nnade extremely difficult to approach 
the transcriptomic study by this technology. Indeed, in the 
sample used with high coverage (human monocytes, with 
200 millions of paired reads), we were able to detect only 
three of the splice variants detected by nested PCR, and 
only when Cufflinks parameters were altered. For these 
reasons, particular detailed transcriptional studies of low 
expression genes such as RAGE could be misinterpreted by 
high-throughput sequencing technologies, and it would be 
better to study them by other techniques like the one we 
use in this study. 

The transcript variants found and their common splice 
events could have potential regulatory functions about the 
overall role of the Tvl-RAGE isoform, and could be involved 
in the development of several diseases including cancer or 
Alzheimer's disease, among others. For instance, low levels 
of circulating sRAGE have been associated with an increased 
risk of Alzheimer's disease (Emanuele et al. 2005), diabetic 
complications (Grossin et al. 2008), cardiovascular disease 
(Falcone et al. 2005), and hypertension (Geroldi et al. 2005) 
in humans. Removal of the transmembrane region allows 
the generation of sRAGE isoforms by different biological 
processes. These soluble variants (sRAGE) found in humans 
are due to either alternative splicing (esRAGE) or proteolytic 
cleavage (Maillard-Lefebvre et al. 2009) of carboxyl-terminal 
truncation of membrane-bound isoforms (cRAGE) (Hanford 
et al. 2004) by the sheddase ADAM 10 (Raucci et al. 2008). 
Adult rat sRAGE shows high protein expression levels, 2-fold 
higher compared with neonatal rat (Lizotte et al. 2007) 
and then, proteolytic cleavage of the membrane-bound 
isoforms seems to be the main way of sRAGE generation 
(Hanford et al. 2004). To conclude, any of these biological 
events could generate soluble receptors of RAGE which 
could bind to ligands before they find membrane-bound 
receptor RAGE, and thereby attenuate intracellular RAGE 
signaling. We have been able to detect potential soluble 
isoforms in all the analyzed species, although only in human 
(Tv4-RAGE and Tv6-RAGE) and mouse (/WmusRAGEv4) 
they contain the three Ig domains and a signal peptide. 
Interestingly, all species except rat present soluble isoforms 
with only the first two Ig domains, which are the ones involved 
in ligand binding. 

Furthermore, we have found splicing events resulting in 
changes in the extracellular domain that may affect the 
ligand binding domain, by insertion, deletion, or removal of 
partial Ig-V domain of RAGE. Binding of AGEs (Grossin et al. 
2009) or Ap peptides (Ding and Keller 2005a) to the Ig-V 
domain in the membrane-bound receptor RAGE has been 
suggested to contribute to the pathogenesis of diabetic 
vascular complications and Alzheimer's disease, respectively. 
Ig-V domain modification could generate changes in ligand 
affinity of RAGE, and splice variants with these domains mod- 
ifications could be important factors in disease modulation. 



Indeed, there are several variants that lack the signal peptide, 
and hence, their specific intracellular transport and structure 
could be altered. This sort of regulation could modify the final 
location of each isoform and, therefore, its specific function. 
Furthermore, most of these splice variants have a PSC and 
they should be degraded quickly by nonsense-mediated 
decay. However, they seem to be stable, as it has been 
shown by other studies (Hudson et al. 2008; Kalea et al. 
2009) and our case, finding them to be relatively abundant. 
The only two variants that are found in all the analyzed species 
are the equivalent isoforms to human Tvl-RAGE and TvlO- 
RAGE (table 1), which are also the most abundant ones. In 
fact, TvlO-RAGE presents a PSC but is even more expressed 
than Tvl-RAGE in some species (Artiodactyla group). This is 
very interesting from the evolutionary point of view of the 
conservation of the highly and widely expressed TvlO-RAGE 
variant in tissues and species, which can be related to gener- 
ation of a new function. In human, monkey, and pig samples, 
the predicted protein lacks the first IgV domain, whereas in 
cow and rat samples, it would also encode the first IgV 
domain, although it has to be considered that they do not 
contain a signal peptide. In addition, this TvlO-RAGE is a pre- 
dicted noncoding variant in mouse. 

On the other hand, other splice variants of RAGE were 
shown in previous studies, such as dominant-negative RAGE 
(dnRAGE or RAGEA), which only lacks 16 amino acids of the 
intracellular domain (Ding and Keller 2005a). These splice var- 
iants can bind to Ap ligand and thereby also prevent RAGE 
signaling. But neither our study nor previous related studies 
(Yonekura et al. 2003; Hudson et al. 2008; Kalea et al. 2009; 
Sterenczak et al. 2009) have found these dnRAGE variants, 
therefore dnRAGE could be tissue or disease specific. We have 
not detected any other alterations on the cytoplasmic tail in 
any of the analyzed mammals. 

There are also novel transcripts of unknown function, 
which are mainly long and small ncRNAs. Intron retention is 
the common splice event in potential long ncRNAs, whereas 
large exon skipping generates the potential small ones. 
Previous studies related to ncRNAs show noncoding genes, 
such as XIST (Duret et al. 2006) or HOTAIR (Rinn et al. 
2007), but not much is known about coding genes with 
alternative noncoding splice variants, as we show in this 
study. This event could significantly increase the number of 
ncRNAs among wide RNAs. Moreover, these ncRNAs may 
underlie vital intracellular functions, such as transcriptional 
regulation or epigenetics among others (Kikuchi et al. 2009; 
Marques and Ponting 2009; Ponting et al. 2009), being 
generated from individual transcript or multiple rounds of 
cleavage of a unique loci (Tuck and Tollervey 2011). In this 
particular case, the presence of these different noncoding 
variants could be to regulate the full-length RAGE isoform 
and/or its functions in each specific tissue. Because of that, 
long and small noncoding RAGE variants seem to be involved 
in different processes, since long ncRNAs show wide 
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expression among the different tissues and, by contrast, small 
ncRNAs only had low expression and very specific distribution. 
Furthermore, we have also observed that most of the ncRNAs 
detected in our study seem to be species-specific splice vari- 
ants and only a few of them are conserved among the ana- 
lyzed mammals. These nonconserved variants could be due 
to transcriptional noise corresponding to the phenomenon 
described as noisy splicing (Baek and Green 2005; Zhang 
et al. 2007, 2009; Melamud and Moult 2009; Pickrell et al. 
2010) as it is associated to lowly expressed genes and splice 
junctions not conserved in different tissues nor in species. 
Some authors have mentioned the implication of noisy splicing 
as an important property of genome evolution (Pickrell et al. 
2010). Nevertheless, to further characterize this, it would 
require increasing the number of tissues analyzed in each 
species to fully determine the species specificity of each 
splice variant. Either different distribution or species-specific 
noncoding RAGE variants could generate a wide diversity of 
functional roles that remain still unknown. 

Conclusions 

in conclusion, we provide evidence that RAGE undergoes ex- 
tensive AS in the six analyzed mammalian species, generating 
different isoforms which are distributed among tissues (fetal, 
adult, and tumor samples) and species (human, monkey, cow, 
pig, mouse, and rat). No such detailed analysis, which com- 
pares a complex alternative spliced gene in different species 
and tissues, has been previously performed. These transcripts 
have been detected through direct RT-PCR amplifications and 
sequencing of the products but not by RNA-seq using default 
Cufflinks parameters although a very few can be detected 
after altering them. In summary, we can classify these 
transcripts in: highly expressed conserved splice variants; low 
but stable expressed, less or not conserved variants; and lowly 
expressed, in only one or two conditions, species-specific 
variants. Comparison between the splice variants in these 
mammals reveals only two totally conserved ones (human 
TV1-RAGE and TV10-RAGE and their homologs) indicating a 
special evolution pattern at the mRNA level. This is interesting 
from the point of view of different biological functions of one 
particular gene among different species and, in particular, 
when they are used as animal models. Furthermore, a large 
majority of these variants are predicted as noncoding isoforms 
with unknown function, more than previously known. In fact, 
TvlO-RAGE and homologs could be a noncoding variant that 
is highly and broadly expressed. After this analysis, we have 
observed an even more complex and tissue-specific AS 
variants than expected among mammals, including a great 
variability of coding and noncoding variants. For this reason, 
further extensive studies of functionally relevance of each 
splice variant are necessary to understand the overall role of 
RAGE biology, and their possible implication among diseases 
and evolution. Indeed, similar studies should be performed 



with other genes to evaluate whether complex AS patterns 
are common. 

Supplementary Material 

Supplementary figures SI -S3 and tables SI and S2 are avail- 
able at Genome Biology and Evolution online (http;//www. 
gbe.oxfordjournals.org/). 
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